Overview
Brought to you by YData
Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 1716428 |
| Missing cells | 3939947 |
| Missing cells (%) | 13.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 448.3 MiB |
| Average record size in memory | 273.9 B |
Variable types
| Numeric | 14 |
|---|---|
| Categorical | 3 |
AMT_CREDIT_SUM_DEBT is highly overall correlated with DAYS_CREDIT_ENDDATE and 1 other fields | High correlation |
AMT_CREDIT_SUM_OVERDUE is highly overall correlated with CREDIT_DAY_OVERDUE | High correlation |
CREDIT_DAY_OVERDUE is highly overall correlated with AMT_CREDIT_SUM_OVERDUE | High correlation |
DAYS_CREDIT is highly overall correlated with DAYS_CREDIT_ENDDATE and 2 other fields | High correlation |
DAYS_CREDIT_ENDDATE is highly overall correlated with AMT_CREDIT_SUM_DEBT and 3 other fields | High correlation |
DAYS_CREDIT_UPDATE is highly overall correlated with AMT_CREDIT_SUM_DEBT and 3 other fields | High correlation |
DAYS_ENDDATE_FACT is highly overall correlated with DAYS_CREDIT and 2 other fields | High correlation |
CREDIT_ACTIVE is highly imbalanced (50.9%) | Imbalance |
CREDIT_CURRENCY is highly imbalanced (99.5%) | Imbalance |
CREDIT_TYPE is highly imbalanced (72.7%) | Imbalance |
DAYS_CREDIT_ENDDATE has 105553 (6.1%) missing values | Missing |
DAYS_ENDDATE_FACT has 633653 (36.9%) missing values | Missing |
AMT_CREDIT_MAX_OVERDUE has 1124488 (65.5%) missing values | Missing |
AMT_CREDIT_SUM_DEBT has 257669 (15.0%) missing values | Missing |
AMT_CREDIT_SUM_LIMIT has 591780 (34.5%) missing values | Missing |
AMT_ANNUITY has 1226791 (71.5%) missing values | Missing |
CREDIT_DAY_OVERDUE is highly skewed (γ1 = 55.93100542) | Skewed |
AMT_CREDIT_MAX_OVERDUE is highly skewed (γ1 = 470.9138195) | Skewed |
CNT_CREDIT_PROLONG is highly skewed (γ1 = 20.31927659) | Skewed |
AMT_CREDIT_SUM is highly skewed (γ1 = 124.5860969) | Skewed |
AMT_CREDIT_SUM_DEBT is highly skewed (γ1 = 36.41453834) | Skewed |
AMT_CREDIT_SUM_OVERDUE is highly skewed (γ1 = 403.2418584) | Skewed |
AMT_ANNUITY is highly skewed (γ1 = 212.5431248) | Skewed |
SK_ID_BUREAU has unique values | Unique |
CREDIT_DAY_OVERDUE has 1712211 (99.8%) zeros | Zeros |
AMT_CREDIT_MAX_OVERDUE has 470650 (27.4%) zeros | Zeros |
CNT_CREDIT_PROLONG has 1707314 (99.5%) zeros | Zeros |
AMT_CREDIT_SUM has 66582 (3.9%) zeros | Zeros |
AMT_CREDIT_SUM_DEBT has 1016434 (59.2%) zeros | Zeros |
AMT_CREDIT_SUM_LIMIT has 1050142 (61.2%) zeros | Zeros |
AMT_CREDIT_SUM_OVERDUE has 1712270 (99.8%) zeros | Zeros |
AMT_ANNUITY has 256915 (15.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-08-23 10:16:08.927867 |
|---|---|
| Analysis finished | 2025-08-23 10:19:31.465641 |
| Duration | 3 minutes and 22.54 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
SK_ID_CURR
Real number (ℝ)
| Distinct | 305811 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 278214.93 |
| Minimum | 100001 |
|---|---|
| Maximum | 456255 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.5 MiB |
Quantile statistics
| Minimum | 100001 |
|---|---|
| 5-th percentile | 117919 |
| Q1 | 188866.75 |
| median | 278055 |
| Q3 | 367426 |
| 95-th percentile | 438632 |
| Maximum | 456255 |
| Range | 356254 |
| Interquartile range (IQR) | 178559.25 |
Descriptive statistics
| Standard deviation | 102938.56 |
|---|---|
| Coefficient of variation (CV) | 0.36999652 |
| Kurtosis | -1.2027765 |
| Mean | 278214.93 |
| Median Absolute Deviation (MAD) | 89275 |
| Skewness | 0.0010628877 |
| Sum | 4.775359 × 1011 |
| Variance | 1.0596347 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 120860 | 116 | < 0.1% |
| 169704 | 94 | < 0.1% |
| 318065 | 78 | < 0.1% |
| 251643 | 61 | < 0.1% |
| 425396 | 60 | < 0.1% |
| 295809 | 59 | < 0.1% |
| 129843 | 58 | < 0.1% |
| 385133 | 57 | < 0.1% |
| 177014 | 56 | < 0.1% |
| 280155 | 55 | < 0.1% |
| Other values (305801) | 1715734 |
| Value | Count | Frequency (%) |
| 100001 | 7 | < 0.1% |
| 100002 | 8 | |
| 100003 | 4 | < 0.1% |
| 100004 | 2 | < 0.1% |
| 100005 | 3 | < 0.1% |
| 100007 | 1 | < 0.1% |
| 100008 | 3 | < 0.1% |
| 100009 | 18 | |
| 100010 | 2 | < 0.1% |
| 100011 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 456255 | 11 | |
| 456254 | 1 | < 0.1% |
| 456253 | 4 | < 0.1% |
| 456250 | 3 | < 0.1% |
| 456249 | 13 | |
| 456247 | 11 | |
| 456246 | 3 | < 0.1% |
| 456244 | 23 | |
| 456243 | 7 | < 0.1% |
| 456242 | 1 | < 0.1% |
SK_ID_BUREAU
Real number (ℝ)
Unique 
| Distinct | 1716428 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5924434.5 |
| Minimum | 5000000 |
|---|---|
| Maximum | 6843457 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.5 MiB |
Quantile statistics
| Minimum | 5000000 |
|---|---|
| 5-th percentile | 5092405.3 |
| Q1 | 5463953.8 |
| median | 5926303.5 |
| Q3 | 6385681.2 |
| 95-th percentile | 6751974.7 |
| Maximum | 6843457 |
| Range | 1843457 |
| Interquartile range (IQR) | 921727.5 |
Descriptive statistics
| Standard deviation | 532265.73 |
|---|---|
| Coefficient of variation (CV) | 0.089842453 |
| Kurtosis | -1.1990158 |
| Mean | 5924434.5 |
| Median Absolute Deviation (MAD) | 460849.5 |
| Skewness | -0.0074978321 |
| Sum | 1.0168865 × 1013 |
| Variance | 2.8330681 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5714462 | 1 | < 0.1% |
| 6758530 | 1 | < 0.1% |
| 6758496 | 1 | < 0.1% |
| 6758495 | 1 | < 0.1% |
| 6758494 | 1 | < 0.1% |
| 6758493 | 1 | < 0.1% |
| 6758492 | 1 | < 0.1% |
| 6758491 | 1 | < 0.1% |
| 6758490 | 1 | < 0.1% |
| 6758489 | 1 | < 0.1% |
| Other values (1716418) | 1716418 |
| Value | Count | Frequency (%) |
| 5000000 | 1 | |
| 5000001 | 1 | |
| 5000002 | 1 | |
| 5000003 | 1 | |
| 5000004 | 1 | |
| 5000005 | 1 | |
| 5000006 | 1 | |
| 5000009 | 1 | |
| 5000010 | 1 | |
| 5000011 | 1 |
| Value | Count | Frequency (%) |
| 6843457 | 1 | |
| 6843456 | 1 | |
| 6843455 | 1 | |
| 6843454 | 1 | |
| 6843453 | 1 | |
| 6843452 | 1 | |
| 6843451 | 1 | |
| 6843450 | 1 | |
| 6843447 | 1 | |
| 6843446 | 1 |
CREDIT_ACTIVE
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 103.1 MiB |
| Closed | |
|---|---|
| Active | |
| Sold | 6527 |
| Bad debt | 21 |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 5.9924191 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Closed |
|---|---|
| 2nd row | Active |
| 3rd row | Active |
| 4th row | Active |
| 5th row | Active |
Common Values
| Value | Count | Frequency (%) |
| Closed | 1079273 | |
| Active | 630607 | |
| Sold | 6527 | 0.4% |
| Bad debt | 21 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| closed | 1079273 | |
| active | 630607 | |
| sold | 6527 | 0.4% |
| bad | 21 | < 0.1% |
| debt | 21 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1709901 | |
| d | 1085842 | |
| l | 1085800 | |
| o | 1085800 | |
| C | 1079273 | |
| s | 1079273 | |
| t | 630628 | 6.1% |
| A | 630607 | 6.1% |
| c | 630607 | 6.1% |
| i | 630607 | 6.1% |
| Other values (6) | 637218 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10285556 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1709901 | |
| d | 1085842 | |
| l | 1085800 | |
| o | 1085800 | |
| C | 1079273 | |
| s | 1079273 | |
| t | 630628 | 6.1% |
| A | 630607 | 6.1% |
| c | 630607 | 6.1% |
| i | 630607 | 6.1% |
| Other values (6) | 637218 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10285556 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1709901 | |
| d | 1085842 | |
| l | 1085800 | |
| o | 1085800 | |
| C | 1079273 | |
| s | 1079273 | |
| t | 630628 | 6.1% |
| A | 630607 | 6.1% |
| c | 630607 | 6.1% |
| i | 630607 | 6.1% |
| Other values (6) | 637218 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10285556 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1709901 | |
| d | 1085842 | |
| l | 1085800 | |
| o | 1085800 | |
| C | 1079273 | |
| s | 1079273 | |
| t | 630628 | 6.1% |
| A | 630607 | 6.1% |
| c | 630607 | 6.1% |
| i | 630607 | 6.1% |
| Other values (6) | 637218 | 6.2% |
CREDIT_CURRENCY
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 109.7 MiB |
| currency 1 | |
|---|---|
| currency 2 | 1224 |
| currency 3 | 174 |
| currency 4 | 10 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | currency 1 |
|---|---|
| 2nd row | currency 1 |
| 3rd row | currency 1 |
| 4th row | currency 1 |
| 5th row | currency 1 |
Common Values
| Value | Count | Frequency (%) |
| currency 1 | 1715020 | |
| currency 2 | 1224 | 0.1% |
| currency 3 | 174 | < 0.1% |
| currency 4 | 10 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| currency | 1716428 | |
| 1 | 1715020 | |
| 2 | 1224 | < 0.1% |
| 3 | 174 | < 0.1% |
| 4 | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 3432856 | |
| r | 3432856 | |
| u | 1716428 | |
| e | 1716428 | |
| n | 1716428 | |
| y | 1716428 | |
| 1716428 | ||
| 1 | 1715020 | |
| 2 | 1224 | < 0.1% |
| 3 | 174 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17164280 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 3432856 | |
| r | 3432856 | |
| u | 1716428 | |
| e | 1716428 | |
| n | 1716428 | |
| y | 1716428 | |
| 1716428 | ||
| 1 | 1715020 | |
| 2 | 1224 | < 0.1% |
| 3 | 174 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17164280 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 3432856 | |
| r | 3432856 | |
| u | 1716428 | |
| e | 1716428 | |
| n | 1716428 | |
| y | 1716428 | |
| 1716428 | ||
| 1 | 1715020 | |
| 2 | 1224 | < 0.1% |
| 3 | 174 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17164280 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 3432856 | |
| r | 3432856 | |
| u | 1716428 | |
| e | 1716428 | |
| n | 1716428 | |
| y | 1716428 | |
| 1716428 | ||
| 1 | 1715020 | |
| 2 | 1224 | < 0.1% |
| 3 | 174 | < 0.1% |
DAYS_CREDIT
Real number (ℝ)
High correlation 
| Distinct | 2923 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1142.1077 |
| Minimum | -2922 |
|---|---|
| Maximum | 0 |
| Zeros | 25 |
| Zeros (%) | < 0.1% |
| Negative | 1716403 |
| Negative (%) | > 99.9% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | -2922 |
|---|---|
| 5-th percentile | -2665 |
| Q1 | -1666 |
| median | -987 |
| Q3 | -474 |
| 95-th percentile | -125 |
| Maximum | 0 |
| Range | 2922 |
| Interquartile range (IQR) | 1192 |
Descriptive statistics
| Standard deviation | 795.16493 |
|---|---|
| Coefficient of variation (CV) | -0.69622588 |
| Kurtosis | -0.73544529 |
| Mean | -1142.1077 |
| Median Absolute Deviation (MAD) | 570 |
| Skewness | -0.58234905 |
| Sum | -1.9603456 × 109 |
| Variance | 632287.26 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -364 | 1330 | 0.1% |
| -336 | 1248 | 0.1% |
| -273 | 1238 | 0.1% |
| -357 | 1218 | 0.1% |
| -343 | 1203 | 0.1% |
| -315 | 1202 | 0.1% |
| -371 | 1196 | 0.1% |
| -365 | 1194 | 0.1% |
| -210 | 1193 | 0.1% |
| -245 | 1192 | 0.1% |
| Other values (2913) | 1704214 |
| Value | Count | Frequency (%) |
| -2922 | 278 | |
| -2921 | 283 | |
| -2920 | 317 | |
| -2919 | 344 | |
| -2918 | 329 | |
| -2917 | 292 | |
| -2916 | 296 | |
| -2915 | 299 | |
| -2914 | 317 | |
| -2913 | 317 |
| Value | Count | Frequency (%) |
| 0 | 25 | < 0.1% |
| -1 | 17 | < 0.1% |
| -2 | 42 | < 0.1% |
| -3 | 74 | < 0.1% |
| -4 | 113 | < 0.1% |
| -5 | 146 | |
| -6 | 184 | |
| -7 | 251 | |
| -8 | 319 | |
| -9 | 298 |
CREDIT_DAY_OVERDUE
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 942 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.81816656 |
| Minimum | 0 |
|---|---|
| Maximum | 2792 |
| Zeros | 1712211 |
| Zeros (%) | 99.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2792 |
| Range | 2792 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 36.544428 |
|---|---|
| Coefficient of variation (CV) | 44.666245 |
| Kurtosis | 3374.4841 |
| Mean | 0.81816656 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 55.931005 |
| Sum | 1404324 |
| Variance | 1335.4952 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1712211 | |
| 30 | 311 | < 0.1% |
| 60 | 126 | < 0.1% |
| 8 | 103 | < 0.1% |
| 13 | 103 | < 0.1% |
| 9 | 93 | < 0.1% |
| 7 | 92 | < 0.1% |
| 14 | 91 | < 0.1% |
| 17 | 77 | < 0.1% |
| 11 | 75 | < 0.1% |
| Other values (932) | 3146 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 1712211 | |
| 1 | 5 | < 0.1% |
| 2 | 18 | < 0.1% |
| 3 | 29 | < 0.1% |
| 4 | 46 | < 0.1% |
| 5 | 51 | < 0.1% |
| 6 | 59 | < 0.1% |
| 7 | 92 | < 0.1% |
| 8 | 103 | < 0.1% |
| 9 | 93 | < 0.1% |
| Value | Count | Frequency (%) |
| 2792 | 1 | |
| 2781 | 1 | |
| 2776 | 1 | |
| 2770 | 1 | |
| 2766 | 1 | |
| 2765 | 1 | |
| 2754 | 1 | |
| 2703 | 1 | |
| 2700 | 1 | |
| 2693 | 1 |
DAYS_CREDIT_ENDDATE
Real number (ℝ)
High correlation  Missing 
| Distinct | 14096 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 105553 |
| Missing (%) | 6.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 510.51736 |
| Minimum | -42060 |
|---|---|
| Maximum | 31199 |
| Zeros | 883 |
| Zeros (%) | 0.1% |
| Negative | 1007389 |
| Negative (%) | 58.7% |
| Memory size | 6.5 MiB |
Quantile statistics
| Minimum | -42060 |
|---|---|
| 5-th percentile | -2262 |
| Q1 | -1138 |
| median | -330 |
| Q3 | 474 |
| 95-th percentile | 2623 |
| Maximum | 31199 |
| Range | 73259 |
| Interquartile range (IQR) | 1612 |
Descriptive statistics
| Standard deviation | 4994.2197 |
|---|---|
| Coefficient of variation (CV) | 9.7826638 |
| Kurtosis | 28.180286 |
| Mean | 510.51736 |
| Median Absolute Deviation (MAD) | 806 |
| Skewness | 5.1271334 |
| Sum | 8.2237966 × 108 |
| Variance | 24942230 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 883 | 0.1% |
| 3 | 845 | < 0.1% |
| -7 | 837 | < 0.1% |
| 1 | 830 | < 0.1% |
| -14 | 787 | < 0.1% |
| -10 | 782 | < 0.1% |
| 4 | 777 | < 0.1% |
| -2 | 772 | < 0.1% |
| -1 | 771 | < 0.1% |
| -42 | 768 | < 0.1% |
| Other values (14086) | 1602823 | |
| (Missing) | 105553 | 6.1% |
| Value | Count | Frequency (%) |
| -42060 | 1 | < 0.1% |
| -42056 | 1 | < 0.1% |
| -42042 | 3 | |
| -42041 | 1 | < 0.1% |
| -42013 | 1 | < 0.1% |
| -41938 | 1 | < 0.1% |
| -41920 | 1 | < 0.1% |
| -41910 | 1 | < 0.1% |
| -41905 | 1 | < 0.1% |
| -41899 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 31199 | 1 | < 0.1% |
| 31198 | 89 | |
| 31197 | 63 | |
| 31196 | 50 | < 0.1% |
| 31195 | 103 | |
| 31194 | 122 | |
| 31193 | 150 | |
| 31192 | 110 | |
| 31191 | 108 | |
| 31190 | 90 |
DAYS_ENDDATE_FACT
Real number (ℝ)
High correlation  Missing 
| Distinct | 2917 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 633653 |
| Missing (%) | 36.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1017.4371 |
| Minimum | -42023 |
|---|---|
| Maximum | 0 |
| Zeros | 64 |
| Zeros (%) | < 0.1% |
| Negative | 1082711 |
| Negative (%) | 63.1% |
| Memory size | 6.5 MiB |
Quantile statistics
| Minimum | -42023 |
|---|---|
| 5-th percentile | -2393 |
| Q1 | -1489 |
| median | -897 |
| Q3 | -425 |
| 95-th percentile | -94 |
| Maximum | 0 |
| Range | 42023 |
| Interquartile range (IQR) | 1064 |
Descriptive statistics
| Standard deviation | 714.01068 |
|---|---|
| Coefficient of variation (CV) | -0.70177375 |
| Kurtosis | 9.409193 |
| Mean | -1017.4371 |
| Median Absolute Deviation (MAD) | 513 |
| Skewness | -0.77475387 |
| Sum | -1.1016555 × 109 |
| Variance | 509811.22 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -329 | 811 | < 0.1% |
| -273 | 794 | < 0.1% |
| -301 | 791 | < 0.1% |
| -91 | 785 | < 0.1% |
| -154 | 783 | < 0.1% |
| -84 | 783 | < 0.1% |
| -182 | 782 | < 0.1% |
| -238 | 778 | < 0.1% |
| -210 | 778 | < 0.1% |
| -350 | 773 | < 0.1% |
| Other values (2907) | 1074917 | |
| (Missing) | 633653 |
| Value | Count | Frequency (%) |
| -42023 | 1 | |
| -3042 | 1 | |
| -2922 | 1 | |
| -2919 | 1 | |
| -2917 | 1 | |
| -2916 | 2 | |
| -2915 | 2 | |
| -2914 | 2 | |
| -2913 | 1 | |
| -2912 | 2 |
| Value | Count | Frequency (%) |
| 0 | 64 | < 0.1% |
| -1 | 217 | |
| -2 | 162 | < 0.1% |
| -3 | 223 | |
| -4 | 265 | |
| -5 | 373 | |
| -6 | 369 | |
| -7 | 429 | |
| -8 | 411 | |
| -9 | 414 |
AMT_CREDIT_MAX_OVERDUE
Real number (ℝ)
Missing  Skewed  Zeros 
| Distinct | 68251 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 1124488 |
| Missing (%) | 65.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3825.4177 |
| Minimum | 0 |
|---|---|
| Maximum | 1.1598718 × 108 |
| Zeros | 470650 |
| Zeros (%) | 27.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 13.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 14220.452 |
| Maximum | 1.1598718 × 108 |
| Range | 1.1598718 × 108 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 206031.61 |
|---|---|
| Coefficient of variation (CV) | 53.858591 |
| Kurtosis | 245696.92 |
| Mean | 3825.4177 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 470.91382 |
| Sum | 2.2644177 × 109 |
| Variance | 4.2449023 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 470650 | |
| 1440 | 688 | < 0.1% |
| 225 | 405 | < 0.1% |
| 45 | 377 | < 0.1% |
| 4.5 | 315 | < 0.1% |
| 90 | 222 | < 0.1% |
| 4500 | 220 | < 0.1% |
| 2700 | 192 | < 0.1% |
| 9000 | 192 | < 0.1% |
| 5400 | 189 | < 0.1% |
| Other values (68241) | 118490 | 6.9% |
| (Missing) | 1124488 |
| Value | Count | Frequency (%) |
| 0 | 470650 | |
| 0.045 | 17 | < 0.1% |
| 0.09 | 4 | < 0.1% |
| 0.135 | 12 | < 0.1% |
| 0.18 | 5 | < 0.1% |
| 0.225 | 6 | < 0.1% |
| 0.27 | 2 | < 0.1% |
| 0.315 | 8 | < 0.1% |
| 0.36 | 4 | < 0.1% |
| 0.405 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 115987185 | 1 | |
| 94812246 | 1 | |
| 16950010.5 | 1 | |
| 14111390.7 | 1 | |
| 13975258.5 | 1 | |
| 13766418 | 1 | |
| 13144527 | 1 | |
| 11478060 | 1 | |
| 11246044.5 | 1 | |
| 10861812 | 1 |
CNT_CREDIT_PROLONG
Real number (ℝ)
Skewed  Zeros 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0064104058 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 1707314 |
| Zeros (%) | 99.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.096223906 |
|---|---|
| Coefficient of variation (CV) | 15.010579 |
| Kurtosis | 615.43877 |
| Mean | 0.0064104058 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.319277 |
| Sum | 11003 |
| Variance | 0.00925904 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1707314 | |
| 1 | 7620 | 0.4% |
| 2 | 1222 | 0.1% |
| 3 | 191 | < 0.1% |
| 4 | 54 | < 0.1% |
| 5 | 21 | < 0.1% |
| 9 | 2 | < 0.1% |
| 6 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1707314 | |
| 1 | 7620 | 0.4% |
| 2 | 1222 | 0.1% |
| 3 | 191 | < 0.1% |
| 4 | 54 | < 0.1% |
| 5 | 21 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 5 | 21 | < 0.1% |
| 4 | 54 | < 0.1% |
| 3 | 191 | < 0.1% |
| 2 | 1222 | 0.1% |
| 1 | 7620 | 0.4% |
| 0 | 1707314 |
AMT_CREDIT_SUM
Real number (ℝ)
Skewed  Zeros 
| Distinct | 236708 |
|---|---|
| Distinct (%) | 13.8% |
| Missing | 13 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 354994.59 |
| Minimum | 0 |
|---|---|
| Maximum | 5.85 × 108 |
| Zeros | 66582 |
| Zeros (%) | 3.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 13.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11250 |
| Q1 | 51300 |
| median | 125518.5 |
| Q3 | 315000 |
| 95-th percentile | 1350000 |
| Maximum | 5.85 × 108 |
| Range | 5.85 × 108 |
| Interquartile range (IQR) | 263700 |
Descriptive statistics
| Standard deviation | 1149811.3 |
|---|---|
| Coefficient of variation (CV) | 3.2389545 |
| Kurtosis | 49315.967 |
| Mean | 354994.59 |
| Median Absolute Deviation (MAD) | 93451.5 |
| Skewness | 124.5861 |
| Sum | 6.0931804 × 1011 |
| Variance | 1.3220661 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 66582 | 3.9% |
| 225000 | 57608 | 3.4% |
| 135000 | 50195 | 2.9% |
| 450000 | 37156 | 2.2% |
| 90000 | 36940 | 2.2% |
| 180000 | 28840 | 1.7% |
| 45000 | 26570 | 1.5% |
| 67500 | 25444 | 1.5% |
| 270000 | 22467 | 1.3% |
| 675000 | 20581 | 1.2% |
| Other values (236698) | 1344032 |
| Value | Count | Frequency (%) |
| 0 | 66582 | |
| 0.45 | 80 | < 0.1% |
| 2.565 | 1 | < 0.1% |
| 4.5 | 546 | < 0.1% |
| 9 | 10 | < 0.1% |
| 13.5 | 13 | < 0.1% |
| 14.13 | 1 | < 0.1% |
| 18 | 3 | < 0.1% |
| 21.645 | 1 | < 0.1% |
| 22.5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 585000000 | 1 | < 0.1% |
| 396000000 | 1 | < 0.1% |
| 170100000 | 1 | < 0.1% |
| 164032200 | 1 | < 0.1% |
| 146958507 | 1 | < 0.1% |
| 142290000 | 1 | < 0.1% |
| 135000000 | 3 | |
| 132750000 | 6 | |
| 112500000 | 1 | < 0.1% |
| 106745400 | 1 | < 0.1% |
AMT_CREDIT_SUM_DEBT
Real number (ℝ)
High correlation  Missing  Skewed  Zeros 
| Distinct | 226537 |
|---|---|
| Distinct (%) | 15.5% |
| Missing | 257669 |
| Missing (%) | 15.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137085.12 |
| Minimum | -4705600.3 |
|---|---|
| Maximum | 1.701 × 108 |
| Zeros | 1016434 |
| Zeros (%) | 59.2% |
| Negative | 8418 |
| Negative (%) | 0.5% |
| Memory size | 13.1 MiB |
Quantile statistics
| Minimum | -4705600.3 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 40153.5 |
| 95-th percentile | 628902.45 |
| Maximum | 1.701 × 108 |
| Range | 1.748056 × 108 |
| Interquartile range (IQR) | 40153.5 |
Descriptive statistics
| Standard deviation | 677401.13 |
|---|---|
| Coefficient of variation (CV) | 4.9414636 |
| Kurtosis | 5673.4343 |
| Mean | 137085.12 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 36.414538 |
| Sum | 1.9997415 × 1011 |
| Variance | 4.5887229 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1016434 | |
| 4.5 | 653 | < 0.1% |
| -450 | 543 | < 0.1% |
| 135000 | 344 | < 0.1% |
| 90000 | 320 | < 0.1% |
| 45000 | 316 | < 0.1% |
| 22500 | 307 | < 0.1% |
| 67500 | 238 | < 0.1% |
| 225000 | 237 | < 0.1% |
| 13500 | 205 | < 0.1% |
| Other values (226527) | 439162 | |
| (Missing) | 257669 | 15.0% |
| Value | Count | Frequency (%) |
| -4705600.32 | 1 | |
| -3109510.98 | 1 | |
| -2796723.72 | 1 | |
| -2273021.73 | 1 | |
| -2167229.34 | 1 | |
| -2089184.31 | 1 | |
| -2014753.455 | 1 | |
| -1764858.06 | 1 | |
| -1354875.615 | 1 | |
| -1093553.235 | 1 |
| Value | Count | Frequency (%) |
| 170100000 | 1 | |
| 164032200 | 1 | |
| 65441403 | 1 | |
| 64570243.5 | 1 | |
| 62218953 | 1 | |
| 59637690 | 1 | |
| 51750000 | 1 | |
| 51365155.5 | 1 | |
| 47406861 | 1 | |
| 44968383 | 1 |
AMT_CREDIT_SUM_LIMIT
Real number (ℝ)
Missing  Zeros 
| Distinct | 51726 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 591780 |
| Missing (%) | 34.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6229.515 |
| Minimum | -586406.11 |
|---|---|
| Maximum | 4705600.3 |
| Zeros | 1050142 |
| Zeros (%) | 61.2% |
| Negative | 351 |
| Negative (%) | < 0.1% |
| Memory size | 13.1 MiB |
Quantile statistics
| Minimum | -586406.11 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 5736.0667 |
| Maximum | 4705600.3 |
| Range | 5292006.4 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 45032.031 |
|---|---|
| Coefficient of variation (CV) | 7.2288182 |
| Kurtosis | 796.09609 |
| Mean | 6229.515 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.026914 |
| Sum | 7.0060116 × 109 |
| Variance | 2.0278839 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1050142 | |
| 135000 | 2178 | 0.1% |
| 4500 | 1474 | 0.1% |
| 45000 | 1335 | 0.1% |
| 90000 | 974 | 0.1% |
| 13500 | 833 | < 0.1% |
| 22500 | 766 | < 0.1% |
| 225000 | 757 | < 0.1% |
| 67500 | 678 | < 0.1% |
| 450000 | 558 | < 0.1% |
| Other values (51716) | 64953 | 3.8% |
| (Missing) | 591780 |
| Value | Count | Frequency (%) |
| -586406.115 | 1 | |
| -401346.945 | 1 | |
| -399166.875 | 1 | |
| -372598.245 | 1 | |
| -316391.895 | 1 | |
| -255704.355 | 1 | |
| -250138.845 | 1 | |
| -234151.065 | 1 | |
| -223123.05 | 1 | |
| -216620.01 | 1 |
| Value | Count | Frequency (%) |
| 4705600.32 | 1 | |
| 4500000 | 2 | |
| 4443255 | 1 | |
| 3555065.655 | 1 | |
| 3375000 | 2 | |
| 3109510.98 | 1 | |
| 3037500 | 1 | |
| 2801223.72 | 1 | |
| 2700000 | 2 | |
| 2648700 | 1 |
AMT_CREDIT_SUM_OVERDUE
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 1616 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.912758 |
| Minimum | 0 |
|---|---|
| Maximum | 3756681 |
| Zeros | 1712270 |
| Zeros (%) | 99.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 13.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 3756681 |
| Range | 3756681 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5937.65 |
|---|---|
| Coefficient of variation (CV) | 156.61351 |
| Kurtosis | 211836.85 |
| Mean | 37.912758 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 403.24186 |
| Sum | 65074519 |
| Variance | 35255688 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1712270 | |
| 4.5 | 301 | < 0.1% |
| 9 | 107 | < 0.1% |
| 13.5 | 81 | < 0.1% |
| 18 | 72 | < 0.1% |
| 22.5 | 60 | < 0.1% |
| 45 | 56 | < 0.1% |
| 27 | 52 | < 0.1% |
| 36 | 50 | < 0.1% |
| 31.5 | 48 | < 0.1% |
| Other values (1606) | 3331 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 1712270 | |
| 0.045 | 3 | < 0.1% |
| 0.27 | 3 | < 0.1% |
| 0.315 | 1 | < 0.1% |
| 0.36 | 3 | < 0.1% |
| 0.675 | 1 | < 0.1% |
| 0.72 | 1 | < 0.1% |
| 0.765 | 1 | < 0.1% |
| 0.81 | 1 | < 0.1% |
| 0.855 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3756681 | 1 | |
| 3681063 | 1 | |
| 2387232 | 1 | |
| 1851210 | 1 | |
| 1617403.5 | 1 | |
| 1361214 | 1 | |
| 1329597 | 1 | |
| 1224474.885 | 1 | |
| 1125733.5 | 1 | |
| 1097437.5 | 1 |
CREDIT_TYPE
Categorical
Imbalance 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 116.0 MiB |
| Consumer credit | |
|---|---|
| Credit card | |
| Car loan | 27690 |
| Mortgage | 18391 |
| Microloan | 12413 |
| Other values (10) | 4124 |
Length
| Max length | 44 |
|---|---|
| Median length | 15 |
| Mean length | 13.858992 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Consumer credit |
|---|---|
| 2nd row | Credit card |
| 3rd row | Consumer credit |
| 4th row | Credit card |
| 5th row | Consumer credit |
Common Values
| Value | Count | Frequency (%) |
| Consumer credit | 1251615 | |
| Credit card | 402195 | 23.4% |
| Car loan | 27690 | 1.6% |
| Mortgage | 18391 | 1.1% |
| Microloan | 12413 | 0.7% |
| Loan for business development | 1975 | 0.1% |
| Another type of loan | 1017 | 0.1% |
| Unknown type of loan | 555 | < 0.1% |
| Loan for working capital replenishment | 469 | < 0.1% |
| Cash loan (non-earmarked) | 56 | < 0.1% |
| Other values (5) | 52 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| credit | 1653811 | |
| consumer | 1251615 | |
| card | 402195 | 11.8% |
| loan | 31813 | 0.9% |
| car | 27690 | 0.8% |
| mortgage | 18391 | 0.5% |
| microloan | 12413 | 0.4% |
| for | 2467 | 0.1% |
| business | 1975 | 0.1% |
| development | 1975 | 0.1% |
| Other values (20) | 6388 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 3370683 | |
| e | 2935997 | |
| d | 2058041 | |
| 1694305 | 7.1% | |
| C | 1681556 | 7.1% |
| t | 1677798 | 7.1% |
| i | 1669634 | 7.0% |
| c | 1666716 | 7.0% |
| o | 1334782 | 5.6% |
| n | 1304025 | 5.5% |
| Other values (24) | 4394425 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23787962 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 3370683 | |
| e | 2935997 | |
| d | 2058041 | |
| 1694305 | 7.1% | |
| C | 1681556 | 7.1% |
| t | 1677798 | 7.1% |
| i | 1669634 | 7.0% |
| c | 1666716 | 7.0% |
| o | 1334782 | 5.6% |
| n | 1304025 | 5.5% |
| Other values (24) | 4394425 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23787962 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 3370683 | |
| e | 2935997 | |
| d | 2058041 | |
| 1694305 | 7.1% | |
| C | 1681556 | 7.1% |
| t | 1677798 | 7.1% |
| i | 1669634 | 7.0% |
| c | 1666716 | 7.0% |
| o | 1334782 | 5.6% |
| n | 1304025 | 5.5% |
| Other values (24) | 4394425 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23787962 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 3370683 | |
| e | 2935997 | |
| d | 2058041 | |
| 1694305 | 7.1% | |
| C | 1681556 | 7.1% |
| t | 1677798 | 7.1% |
| i | 1669634 | 7.0% |
| c | 1666716 | 7.0% |
| o | 1334782 | 5.6% |
| n | 1304025 | 5.5% |
| Other values (24) | 4394425 |
DAYS_CREDIT_UPDATE
Real number (ℝ)
High correlation 
| Distinct | 2982 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -593.74832 |
| Minimum | -41947 |
|---|---|
| Maximum | 372 |
| Zeros | 605 |
| Zeros (%) | < 0.1% |
| Negative | 1715806 |
| Negative (%) | > 99.9% |
| Memory size | 6.5 MiB |
Quantile statistics
| Minimum | -41947 |
|---|---|
| 5-th percentile | -2079 |
| Q1 | -908 |
| median | -395 |
| Q3 | -33 |
| 95-th percentile | -8 |
| Maximum | 372 |
| Range | 42319 |
| Interquartile range (IQR) | 875 |
Descriptive statistics
| Standard deviation | 720.74731 |
|---|---|
| Coefficient of variation (CV) | -1.2138936 |
| Kurtosis | 596.37366 |
| Mean | -593.74832 |
| Median Absolute Deviation (MAD) | 372 |
| Skewness | -11.334995 |
| Sum | -1.0191262 × 109 |
| Variance | 519476.69 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -7 | 18503 | 1.1% |
| -8 | 18462 | 1.1% |
| -11 | 16975 | 1.0% |
| -15 | 16870 | 1.0% |
| -12 | 16827 | 1.0% |
| -10 | 16651 | 1.0% |
| -9 | 16546 | 1.0% |
| -13 | 16387 | 1.0% |
| -6 | 16281 | 0.9% |
| -14 | 16210 | 0.9% |
| Other values (2972) | 1546716 |
| Value | Count | Frequency (%) |
| -41947 | 1 | |
| -41946 | 2 | |
| -41945 | 1 | |
| -41943 | 2 | |
| -41940 | 1 | |
| -41936 | 2 | |
| -41934 | 2 | |
| -41933 | 1 | |
| -41931 | 1 | |
| -41926 | 1 |
| Value | Count | Frequency (%) |
| 372 | 1 | < 0.1% |
| 23 | 2 | |
| 22 | 1 | < 0.1% |
| 20 | 2 | |
| 19 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 4 | |
| 12 | 1 | < 0.1% |
AMT_ANNUITY
Real number (ℝ)
Missing  Skewed  Zeros 
| Distinct | 40321 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 1226791 |
| Missing (%) | 71.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15712.758 |
| Minimum | 0 |
|---|---|
| Maximum | 1.1845342 × 108 |
| Zeros | 256915 |
| Zeros (%) | 15.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 13.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 13500 |
| 95-th percentile | 46571.4 |
| Maximum | 1.1845342 × 108 |
| Range | 1.1845342 × 108 |
| Interquartile range (IQR) | 13500 |
Descriptive statistics
| Standard deviation | 325826.95 |
|---|---|
| Coefficient of variation (CV) | 20.736459 |
| Kurtosis | 58560.694 |
| Mean | 15712.758 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 212.54312 |
| Sum | 7.6935475 × 109 |
| Variance | 1.061632 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 256915 | 15.0% |
| 4500 | 5182 | 0.3% |
| 13500 | 3147 | 0.2% |
| 22500 | 2502 | 0.1% |
| 9000 | 1725 | 0.1% |
| 18000 | 1605 | 0.1% |
| 45000 | 1593 | 0.1% |
| 27000 | 1252 | 0.1% |
| 2700 | 1208 | 0.1% |
| 6750 | 1164 | 0.1% |
| Other values (40311) | 213344 | 12.4% |
| (Missing) | 1226791 |
| Value | Count | Frequency (%) |
| 0 | 256915 | |
| 0.045 | 62 | < 0.1% |
| 0.315 | 1 | < 0.1% |
| 0.45 | 75 | < 0.1% |
| 1.395 | 1 | < 0.1% |
| 1.44 | 2 | < 0.1% |
| 1.8 | 1 | < 0.1% |
| 2.16 | 1 | < 0.1% |
| 3.375 | 1 | < 0.1% |
| 3.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 118453423.5 | 1 | |
| 90632371.5 | 1 | |
| 59586682.5 | 1 | |
| 57476227.5 | 1 | |
| 56844981 | 1 | |
| 54562657.5 | 1 | |
| 45490630.5 | 1 | |
| 43286215.5 | 1 | |
| 42812998.2 | 1 | |
| 33784668 | 1 |
Interactions
Correlations
| AMT_ANNUITY | AMT_CREDIT_MAX_OVERDUE | AMT_CREDIT_SUM | AMT_CREDIT_SUM_DEBT | AMT_CREDIT_SUM_LIMIT | AMT_CREDIT_SUM_OVERDUE | CNT_CREDIT_PROLONG | CREDIT_ACTIVE | CREDIT_CURRENCY | CREDIT_DAY_OVERDUE | CREDIT_TYPE | DAYS_CREDIT | DAYS_CREDIT_ENDDATE | DAYS_CREDIT_UPDATE | DAYS_ENDDATE_FACT | SK_ID_BUREAU | SK_ID_CURR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AMT_ANNUITY | 1.000 | 0.056 | 0.255 | 0.379 | 0.120 | 0.018 | 0.017 | 0.000 | 0.046 | 0.020 | 0.000 | 0.187 | 0.269 | 0.284 | 0.004 | -0.012 | -0.002 |
| AMT_CREDIT_MAX_OVERDUE | 0.056 | 1.000 | 0.114 | -0.004 | 0.012 | 0.066 | 0.063 | 0.006 | 0.055 | 0.066 | 0.023 | -0.204 | -0.070 | -0.047 | -0.114 | -0.010 | -0.002 |
| AMT_CREDIT_SUM | 0.255 | 0.114 | 1.000 | 0.446 | -0.001 | 0.010 | -0.019 | 0.000 | 0.026 | 0.010 | 0.006 | 0.126 | 0.401 | 0.313 | 0.181 | 0.003 | 0.001 |
| AMT_CREDIT_SUM_DEBT | 0.379 | -0.004 | 0.446 | 1.000 | 0.090 | 0.050 | 0.021 | 0.013 | 0.038 | 0.049 | 0.071 | 0.461 | 0.609 | 0.645 | 0.031 | 0.008 | -0.001 |
| AMT_CREDIT_SUM_LIMIT | 0.120 | 0.012 | -0.001 | 0.090 | 1.000 | -0.004 | 0.145 | 0.023 | 0.079 | -0.005 | 0.023 | 0.090 | 0.172 | 0.188 | 0.016 | -0.003 | -0.000 |
| AMT_CREDIT_SUM_OVERDUE | 0.018 | 0.066 | 0.010 | 0.050 | -0.004 | 1.000 | 0.002 | 0.028 | 0.000 | 0.934 | 0.005 | 0.016 | 0.026 | 0.038 | -0.006 | -0.002 | 0.001 |
| CNT_CREDIT_PROLONG | 0.017 | 0.063 | -0.019 | 0.021 | 0.145 | 0.002 | 1.000 | 0.022 | 0.000 | 0.002 | 0.035 | -0.029 | 0.068 | 0.026 | 0.008 | -0.001 | -0.000 |
| CREDIT_ACTIVE | 0.000 | 0.006 | 0.000 | 0.013 | 0.023 | 0.028 | 0.022 | 1.000 | 0.008 | 0.080 | 0.239 | 0.290 | 0.121 | 0.002 | 0.000 | 0.003 | 0.001 |
| CREDIT_CURRENCY | 0.046 | 0.055 | 0.026 | 0.038 | 0.079 | 0.000 | 0.000 | 0.008 | 1.000 | 0.000 | 0.042 | 0.025 | 0.006 | 0.000 | 0.000 | 0.001 | 0.001 |
| CREDIT_DAY_OVERDUE | 0.020 | 0.066 | 0.010 | 0.049 | -0.005 | 0.934 | 0.002 | 0.080 | 0.000 | 1.000 | 0.002 | 0.012 | 0.022 | 0.035 | -0.007 | -0.002 | 0.001 |
| CREDIT_TYPE | 0.000 | 0.023 | 0.006 | 0.071 | 0.023 | 0.005 | 0.035 | 0.239 | 0.042 | 0.002 | 1.000 | 0.071 | 0.314 | 0.013 | 0.000 | 0.007 | 0.002 |
| DAYS_CREDIT | 0.187 | -0.204 | 0.126 | 0.461 | 0.090 | 0.016 | -0.029 | 0.290 | 0.025 | 0.012 | 0.071 | 1.000 | 0.742 | 0.744 | 0.874 | 0.011 | 0.000 |
| DAYS_CREDIT_ENDDATE | 0.269 | -0.070 | 0.401 | 0.609 | 0.172 | 0.026 | 0.068 | 0.121 | 0.006 | 0.022 | 0.314 | 0.742 | 1.000 | 0.810 | 0.881 | 0.012 | 0.001 |
| DAYS_CREDIT_UPDATE | 0.284 | -0.047 | 0.313 | 0.645 | 0.188 | 0.038 | 0.026 | 0.002 | 0.000 | 0.035 | 0.013 | 0.744 | 0.810 | 1.000 | 0.872 | 0.019 | 0.000 |
| DAYS_ENDDATE_FACT | 0.004 | -0.114 | 0.181 | 0.031 | 0.016 | -0.006 | 0.008 | 0.000 | 0.000 | -0.007 | 0.000 | 0.874 | 0.881 | 0.872 | 1.000 | 0.016 | -0.001 |
| SK_ID_BUREAU | -0.012 | -0.010 | 0.003 | 0.008 | -0.003 | -0.002 | -0.001 | 0.003 | 0.001 | -0.002 | 0.007 | 0.011 | 0.012 | 0.019 | 0.016 | 1.000 | 0.000 |
| SK_ID_CURR | -0.002 | -0.002 | 0.001 | -0.001 | -0.000 | 0.001 | -0.000 | 0.001 | 0.001 | 0.001 | 0.002 | 0.000 | 0.001 | 0.000 | -0.001 | 0.000 | 1.000 |
Missing values
Sample
| SK_ID_CURR | SK_ID_BUREAU | CREDIT_ACTIVE | CREDIT_CURRENCY | DAYS_CREDIT | CREDIT_DAY_OVERDUE | DAYS_CREDIT_ENDDATE | DAYS_ENDDATE_FACT | AMT_CREDIT_MAX_OVERDUE | CNT_CREDIT_PROLONG | AMT_CREDIT_SUM | AMT_CREDIT_SUM_DEBT | AMT_CREDIT_SUM_LIMIT | AMT_CREDIT_SUM_OVERDUE | CREDIT_TYPE | DAYS_CREDIT_UPDATE | AMT_ANNUITY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 215354 | 5714462 | Closed | currency 1 | -497 | 0 | -153.0 | -153.0 | NaN | 0 | 91323.00 | 0.00 | NaN | 0.0 | Consumer credit | -131 | NaN |
| 1 | 215354 | 5714463 | Active | currency 1 | -208 | 0 | 1075.0 | NaN | NaN | 0 | 225000.00 | 171342.00 | NaN | 0.0 | Credit card | -20 | NaN |
| 2 | 215354 | 5714464 | Active | currency 1 | -203 | 0 | 528.0 | NaN | NaN | 0 | 464323.50 | NaN | NaN | 0.0 | Consumer credit | -16 | NaN |
| 3 | 215354 | 5714465 | Active | currency 1 | -203 | 0 | NaN | NaN | NaN | 0 | 90000.00 | NaN | NaN | 0.0 | Credit card | -16 | NaN |
| 4 | 215354 | 5714466 | Active | currency 1 | -629 | 0 | 1197.0 | NaN | 77674.5 | 0 | 2700000.00 | NaN | NaN | 0.0 | Consumer credit | -21 | NaN |
| 5 | 215354 | 5714467 | Active | currency 1 | -273 | 0 | 27460.0 | NaN | 0.0 | 0 | 180000.00 | 71017.38 | 108982.62 | 0.0 | Credit card | -31 | NaN |
| 6 | 215354 | 5714468 | Active | currency 1 | -43 | 0 | 79.0 | NaN | 0.0 | 0 | 42103.80 | 42103.80 | 0.00 | 0.0 | Consumer credit | -22 | NaN |
| 7 | 162297 | 5714469 | Closed | currency 1 | -1896 | 0 | -1684.0 | -1710.0 | 14985.0 | 0 | 76878.45 | 0.00 | 0.00 | 0.0 | Consumer credit | -1710 | NaN |
| 8 | 162297 | 5714470 | Closed | currency 1 | -1146 | 0 | -811.0 | -840.0 | 0.0 | 0 | 103007.70 | 0.00 | 0.00 | 0.0 | Consumer credit | -840 | NaN |
| 9 | 162297 | 5714471 | Active | currency 1 | -1146 | 0 | -484.0 | NaN | 0.0 | 0 | 4500.00 | 0.00 | 0.00 | 0.0 | Credit card | -690 | NaN |
| SK_ID_CURR | SK_ID_BUREAU | CREDIT_ACTIVE | CREDIT_CURRENCY | DAYS_CREDIT | CREDIT_DAY_OVERDUE | DAYS_CREDIT_ENDDATE | DAYS_ENDDATE_FACT | AMT_CREDIT_MAX_OVERDUE | CNT_CREDIT_PROLONG | AMT_CREDIT_SUM | AMT_CREDIT_SUM_DEBT | AMT_CREDIT_SUM_LIMIT | AMT_CREDIT_SUM_OVERDUE | CREDIT_TYPE | DAYS_CREDIT_UPDATE | AMT_ANNUITY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1716418 | 433007 | 5057708 | Closed | currency 1 | -1389 | 0 | -1299.0 | -1299.0 | 0.0 | 0 | 334158.435 | 0.0 | 0.0 | 0.0 | Consumer credit | -1299 | NaN |
| 1716419 | 352790 | 5057718 | Closed | currency 1 | -1808 | 0 | -1596.0 | -1625.0 | 8100.0 | 0 | 28248.840 | 0.0 | 0.0 | 0.0 | Consumer credit | -1625 | NaN |
| 1716420 | 352790 | 5057725 | Closed | currency 1 | -99 | 0 | -83.0 | -98.0 | NaN | 0 | 27000.000 | 0.0 | 0.0 | 0.0 | Consumer credit | -18 | NaN |
| 1716421 | 375755 | 5057734 | Closed | currency 1 | -1335 | 0 | -1152.0 | -1152.0 | NaN | 0 | 195408.000 | 0.0 | NaN | 0.0 | Consumer credit | -1139 | NaN |
| 1716422 | 375755 | 5057742 | Closed | currency 1 | -2648 | 0 | 31129.0 | -189.0 | NaN | 0 | 202500.000 | 0.0 | NaN | 0.0 | Credit card | -109 | NaN |
| 1716423 | 259355 | 5057750 | Active | currency 1 | -44 | 0 | -30.0 | NaN | 0.0 | 0 | 11250.000 | 11250.0 | 0.0 | 0.0 | Microloan | -19 | NaN |
| 1716424 | 100044 | 5057754 | Closed | currency 1 | -2648 | 0 | -2433.0 | -2493.0 | 5476.5 | 0 | 38130.840 | 0.0 | 0.0 | 0.0 | Consumer credit | -2493 | NaN |
| 1716425 | 100044 | 5057762 | Closed | currency 1 | -1809 | 0 | -1628.0 | -970.0 | NaN | 0 | 15570.000 | NaN | NaN | 0.0 | Consumer credit | -967 | NaN |
| 1716426 | 246829 | 5057770 | Closed | currency 1 | -1878 | 0 | -1513.0 | -1513.0 | NaN | 0 | 36000.000 | 0.0 | 0.0 | 0.0 | Consumer credit | -1508 | NaN |
| 1716427 | 246829 | 5057778 | Closed | currency 1 | -463 | 0 | NaN | -387.0 | NaN | 0 | 22500.000 | 0.0 | NaN | 0.0 | Microloan | -387 | NaN |